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Abstract 

The controlled-not gate and the single qubit gates are considered elementary gates in 
quantum computing. It is natural to ask how many such elementary gates are needed 
to implement more elaborate gates or circuits. Recall that a controlled-U gate can 
he realized with two controlled-not gates and four single qubit gates. We prove that 
this implementation is optimal if and only if the matrix U satisfies the conditions 
trU =/: 0, triUX) ^ 0, and detU ^ 1. We also derive optimal implementations in 
the remaining non-generic cases. 

1 Introduction 

It was shown in the seminal paper |j^] that any unitary 2" x 2" matrix 
M can be realized on a quantum computer with n quantum bits by a finite 
sequence of controlled-not and single qubit gates. We will refer to controlled- 
not and single qubit gates as elementary gates. It is natural to ask how 
many elementary gates are necessary and sufficient to realize a given unitary 
matrix M. Answering such questions is a notoriously difficult task. 

It was shown in Q that a controlled unitary operation can be realized 
with at most six elementary gates, that is, given a unitary 2x2 matrix U, 
there exist unitary matrices A, B, C, and E such that 



U 



E 
A 



B 



C 



(1) 

Our main result shows that this implementation is optimal: 

Theorem A Suppose that U is a unitary 2x2 matrix satisfying det U ^ 1, 
iiU 7^ 0, and iiUX ^ 0. Then six elementary gates are necessary and 
sufficient to implement a controlled-U gate. 
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Notations. We use the following abbreviations throughout this paper: 



We denote by C the field of complex numbers, and by R the field of real 
numbers. We will say that a unitary matrix U is generic if and only if the 
conditions det U ^ 1, trU ^ 0, and tr UX ^ are satisfied. Notice that the 
inverse of a generic unitary matrix U is again generic. 

Theorem A gives a sharp lower bound on the number of elementary 
gates that are needed to implement a generic controlled-?/ operation. The 
non-generic case is discussed in Theorem B in Section 0. 



2 Proof of Theorem A 

We will show that any implementation of a generic controlled- ?7 operation 
requires at least six elementary gates. We classify the possible implementa- 
tion in terms of the number of controlled-not operations used. We will use 
entanglement properties to rule out various potential implementations. The 
following simple fact will turn out to be particularly helpful: 

Lemma 1 Let \ip), |(/>) be nonzero elements ofC^. The input lip) |(/>) to 
a controlled-U gate will produce an entangled output state if and only if 
is not an eigenvector of U and = a|0) + h\l) with a,b ^ 0. 

Proof The input {ip) (g) = (a|0) + 6|1)) (g> to the controlled-?/ gate 
produces the result 

\rout) =a\O)^\(j)) + b\l)0U\4>) 

Denote by \4>'^) a nonzero vector in satisfying {(l)-^\(f>) = 0. Consequently, 
U\(j)) = c\cl)) + die/)-*-) with c, d G C, and d / 0. Therefore, the output state 
{fout) can be expressed in the form 

Kut) = a|0) 10) +6c|l) \cp)+bd\l) \^^) (2) 

with a,b,d ^ 0. Seeking a contradiction, we assume that \rout) is not 
an entangled state. This would mean that there exist complex coefficients 
a, P, 7, 6 such that 

Kut) = (a|O)+/3|l))0(7|<A)+5|<^^)) 

= a7|0) \(P) + a6\0) {(f)^) + (3j\l) ® {(j)) + (35\l) 



2 



Comparing coefficients with shows that a6 = 0, hence a or 6 has to be 
zero. Either choice leads to a contradiction. 

On the other hand, if is a multiple of |0) or of |1) or if \(p) is an 
eigenvector of U, then it follows from the definitions that the output of the 
controlled-C/ gate will not be entangled. □ 

Corollary 2 Assume that \(p) is an eigenvector of U with eigenvalue X^j), 
and a state G C^. // we input ®\(t>) to the controlled-U gate, then 
the output is of the form diag(l, A,^)!?/;) ® In particular, the output is 
not entangled. 

Another simple consequence of this lemma is that a controlled- ?7 gate is 
able to produce an entangled output state from some input state jV') ® !</>), 
as long as U is not a multiple of the identity matrix. Any matrix U with 
tr(C/X) 7^ satifies this condition. In particular, we need at least one 
controlled-not gate to implement a controlled-?/ gate with tT{UX) ^ 0. 



One ControUed-Not Gate. We consider now possible implementations 
of a generic controlled-C/ gate with only one controlled-not gate and some 
single qubit gates. Recall that it is possible to switch the control and the 
target qubit of a controlled-not gate by conjugation with Hadamard matrices 
H: 



H 



H 



H 



H 



(3) 

Therefore, we can assume without loss of generality that the controlled-C/ 
gate is expressed in the following form: 



U 



A, 



B2 



(4) 



Lemma 3 The unitary matrices Ai,A2 used in the single qubit gates in (0j 
have to be both diagonal or both antidiagonal. 

Proof. Suppose that we input \i) ® |a), where i = 0, 1 and |q) is some ar- 
bitrary vector in such that Bi\a) is not an eigenvector of X. Lemma || 
shows that the output of the controlled-C/ gate is not entangled when pro- 
vided with such an input state, since the most significant qubit is not in 
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superposition. Notice that the circuit on the right hand side in (^) will pro- 
duce an entangled output unless Ai is diagonal or antidiagonal. It follows 
that Ai is of the desired form. The same argument applied to the inverse 
circuits proves that A2 has to be diagonal or antidiagonal. It is clear that 
Ai and A2 are either both diagonal or both antidiagonal, because |00) has 
to be an eigenstate of the circuit □ 

We can assume that Ai and A2 are diagonal. Indeed, if the Ai^s are 
antidiagonal, then we can replace the controlled-not gate in (HI) by 



X 



X 



X 



(5) 



Here we used the fact that XAi and A2X will be both diagonal, when Ai,A2 
are antidiagonal. 

Lemma 4 IftrU 7^ 0, then the circuit ^ cannot implement a controlled-U 
operation. 



Proof. The preceding discussion shows that Ai and A2 are both diagonal or 
antidiagonal, and we may assume that Ai and A2 are diagonal. Thus, there 
exist real numbers i^o ^ind t?i such that j42^i|0) = e*''"|0) and A2^i|l) = 
e*'^i|l). It follows that B2B1 = e-'''^H and B2XB1 = e'^^W . Consequently, 
b\xBi = e*(''o~''i)C/, which implies tr?7 = 0. Therefore, it is in general not 
possible to implement a controlled-C/ operation with one controlled-not gate 
and several single qubit gates. □ 



Two Controlled-Not Gates. Assume that we have now two controlled- 
not gates and several single qubit gates at our disposal. This allows to 
express the controlled-f/ gate in the form 



U 



Ai 



Bi 



Ao 



Bo 



A^ 



B3 



(6) 



In fact, any implementation of a controlled- [/ gate with two controlled-not 
gates can be reduced to this form. Indeed, it is possible to swap the control 
and target qubits of a controlled-not gate by conjugation with Hadamard 
gates, as we have seen in our discussion of the previous case. We will see 
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what kind of properties have to be satisfied by the matrices Ai and Bi. The 
following Lemmas will prepare us to prove Proposition |l^. 

We say that a unitary 2x2 matrix V is sparse if and only if V is diagonal 
or antidiagonal. 

Lemma 5 // the matrix A\ in is sparse, then A2 , A^ are sparse as well. 

Proof. Suppose that Ai is diagonal. Choose a state |0) (g) bIbIIiP) as input, 
where \ip) is not an eigenstate of X. Notice that a controlled-f/ operation 
leaves the input |0) (8) IV') invariant. Consider now the evolution of the 
input state through the circuit on the right hand side of @. Because Ai 
is diagonal, the resulting state after the first controlled-not operation is 
a\0) (S) Bl\ip), where a is a scalar phase factor. Applying the single qubit 
operations A2 and B2 yields q;j42|0) (gi Lemma || shows that the output 
after the second controlled-not operation will be entangled, unless A2 is 
sparse. Therefore, A2 has to be sparse. Thus, the state after the second 
controlled-not is of the form P\i)<^\'ip') , where P is some phase factor, i = 0,1, 
and \ip') is some element of C^. Hence A^ has to map \i) to |0), up to a 
phase factor, i.e., A3 has to be sparse. 

If Ai is antidiagonal, then we can use the identity (^) to replace the 
antidiagonal matrix Ai by the diagonal matrix XAi, which allows us to 
conclude that A2X, hence A2, and A^ are sparse. □ 

Lemma 6 Suppose that U is not a multiple of the identity matrix. If Ai in 
the circuit ^ is not sparse, then A2,A3 are not sparse either. 

Proof. Assume that the input state is of the form ^||0) (8> 14^), where is 
not an eigenvector of U. Since Ai is not sparse, ^||0) = a|0) -|- with 
a, 6 / 0. Therefore, the input A\\0) to the controlled-C/ operation will 
yield an entangled output state, according to Lemma ||. 

On the other hand, consider the right hand side of (|6|). The input state 
A\\0) (g) lifj) produces after the first controlled-not gate a state of the form 
|0) (8) -BilV')- The input to the second controlled-not gate is then A2\0) 
B2Bi\ip). Lemma |^ shows that the output of the second controlled-not 
operation cannot be entangled, unless A2 is not sparse. Therefore, A2 is not 
sparse. However, ^3 cannot be sparse either, because this would imply that 
A2 and Ai are sparse, as can be seen by applying Lemma ^ to the inverse 
circuit. □ 
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Lemma 7 Let U be a unitary 2x2 matrix. Assume that Ai,A2,A3 in ^ 
are sparse. IftrU ^ 0, then B2 7^ H . Ifti(UX) 7^ 0, then none of the 
matrices Bi,B2,Bj, can he equal to an identity matrix, and Bi^B-^ cannot 
both he equal to H . 

Proof. Comparing the result of the inputs |0) ® \tp) and |1) ® on the left 
and right hand side of @ yields 

e*^oj = BsX''B2X^Bi, (7) 

e^^i U = B3X^-''B2X^-^Bi , (8) 

for some k,i€ {0, 1}. Notice that equation (0) implies B2 = e''^^ x'' bIbIx^ . 
Substituting B2 in (P) yields 

U = c'^^^-^^^B^XBIbIxBi. (9) 

Step 1. We show that Bii^I ior i = 1, 2, 3: 

i) Suppose that Bi = /. Equation (^) implies ii{UX) = 0, contradicting 
our assumptions. 

ii) Suppose that B2 = I. Equations (@) and (|) then imply U = e'^^o-Oi) j ^ 
thus tr(UX) = 0, which contradicts our assumptions. 

iii) Suppose that ,63 = /. Equation (|9|) implies tr{XU) = 0, whence 
tv{UX) = 0. This contradicts our assumptions. 

Step 2. We show that B2 7^ H. Seeking a contradiction, we suppose that 
B2 = H. From (0) and (|), U = e'^^''~'^^^ B^X^"^ Z X^ bI, which implies 
tr(C/) = 0. Contradiction. 

Step 3. The case Bi = B^ = H immediately leads to a contradiction, 
because @j would imply U = e*(^o-^i)/, and thus ti{UX) = 0. □ 

Lemma 8 // det U ^ 1, then at least one of the matrices Ai in is not 
equal to the identity matrix. 

Proof. Seeking a contradiction, we assume that ^i,^2;^3 are identity ma- 
trices. It follows at once that B3B2B1 = e^'t'I, and B3XB2XB1 = e^'^U, 
hence det C/ =1. It follows that one of the matrices Ai has to differ from 
the identity matrix. □ 

Lemma 9 // the matrices Ai,A2,A3 in ^ are not sparse, then B2 ^ H 
and bI\ujo) and -B||a;i) are eigenvectors of U , where 

|a;o) = (|0) + |1))/^ and = (|0) - |1))/V2 

are eigenvectors of X. 
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Proof. Consider the input \i) Bl\LUk), with i,k = 0,1. Note that the 
controhed-JJ operation does not produce an entangled output state when 
provided with such an input. On the other hand, consider the evolution of 
these states in the circuit (P). The first two single qubit operations yield 
the state Ai\i) |a;fc). The controlled-not operation produces the state 
Z^Ai\i) \u}k), where we have used the fact that \u>k) is an eigenvector of X 
with eigenvalue (—1)^- The result of the next two single qubit operations is 
then A2Z^ Ai\i) B2\uJk) . Notice that the matrix A2Z'^Ai cannot be sparse, 
because this would imply that A^ is sparse. In other words, the input to 
the second controlled-not gate is a state of the form (aj|0) + i?2|'^fc) 

with ai,hi 7^ 0. Since the circuit implements a controlled-f/ operation, this 
gate should not produce an entangled output state. Therefore, -621"^*;) has 
to be an eigenvector of X. However, this means that the input IV') ® B\\ojk) 
to (^) does not get entangled for arbitrary states Consequently, B\\ojk) 
has to be an eigenvector of U by Lemma |l[ 

The previous discussion showed that i?2|wfc), /c = 0, 1, has to be an 
eigenvector of X, i.e., is mapped to a multiple of \uj() for ^ = 0,1. In 
particular, B2 cannot be the Hadamard matrix H. □ 

Lemma 10 If the matrices Ai,A2,A^ in the circuit are all nonsparse, 
then either A^A2Ai or A3ZA2A1 is a diagonal matrix. In particular, it is 
not possible that Ai = vlj+i = H for i = 1,2. 

Proof. Recall that -B||<^o) is an eigenstate of U, where l^o) = (|0) + |l))/\/2, 
as is shown in Lemma ^. The circuit on the right hand side of (^) maps the 
input |0) O^Jl^o) to A3Z''A2Ai\0)(g)B3B2\LOo), where £ = if ^2 maps I Wo) 
to a multiple of itself, and 1 = 1 otherwise. Comparing this state with the 
supposed output state |0) -B||a;o) shows that ^3Z^^2^i|0) coincides, up 
to a phase factor, with |0). Hence A'^Z^ A2A\ has to be a diagonal unitary 
matrix. The second statement is obvious. □ 

Lemma 11 Let U he a unitary 2x2 matrix with ti(UX) 7^ 0. If Ai, A2, A3 
in ^ are not sparse, then Bi,B2,B3 7^ H, and at least one of the matrices 
Bi,B2, B3 differs from the identity matrix. 

Proof Let |a;o) = (|0) + |1))/V2 and = (|0) - |l))/\/2. Lemma|showed 
that -BIIwo) and -BJIwi) are eigenvectors of U, that is, UBl\uJk) = o.kBl\ujk) 
for A; = 0, 1. Hence BiUB^ = H diag{ao,ai)H. The choice Bi = H would 
force U to be diagonal, which would contradict tT{UX) ^ 0. The same 
argument applied to the inverse circuit proves B3 7^ H. We already know 
that B2 ^ H hy Lemma ^. 
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Seeking a contradiction, we assume that Bi = B2 = = I . A potential 
implementation of the controlled-?/ gate is given by: 



U 



A, 



Ao 



The choice Bi = I implies U = BiUbI = H diag{ao,ai)H, according 
to our discussion above. This special form of U shows that l^o) and \uJi) 
are eigenvectors of U corresponding to the eigenvalues oq and ai. If we 
input a state IV') ^ |i^o)) then Corollary ^ shows that A3A2A1 = diag(l, ao). 
Similarly, we obtain A3ZA2ZA1 = diag(l,ai), considering input states of 
the form \ip) \iOi). 

The determinants of ^3^2^! and A3ZA2ZA1 are the same, hence ao 
has to coincide with ai. However, this implies that U is diagonal, because 
U = Hdiag{ao,ai)H = diag(ao,ao), whence tv{UX) = 0. Therefore, at 
least one of the matrices Bi has to differ from the identity matrix. □ 



A, 



(10) 



Lemma 12 // det U ^ 1, then the circuit (11) cannot implement a controlled- 
U gate. 



B, 



C 



B^ 



(11) 



Proof. Transforming (O) into the form (^) yields Ai = H, A2 = HCH, 
and A3 = H. Lemma |l^ shows that A'^Z^ A2A\ = diag(ao, ai), with £ = 0. 
Therefore, C = diag(ao,ai). A diagonal matrix C satisfies 



C 



C 



It follows that the circuit (11) can be written in the form (|6|) with Ai = I 
for i = 1,2, 3. This contradicts Lemma ^. □ 

Proposition 13 Suppose that U is a unitary 2x2 matrix satisfying tr [/ 7^ 
and tr(UX) ^ 0. Any implementation of a controlled-U gate with two 
controlled-not gates and some single qubit gates needs at least a total of six 
gates provided that det?7 7^ 1, and at least a total of five gates otherwise. 
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Proof. Suppose we are given a fixed control-f/ gate. The implementations 
of a controlled-C/ gate witli two controlled not gates and single qubit gates 
can be classified according to the positions of the target qubits of the two 
controlled-not gates. We will show that any of the four implementation 
types will require at least six elementary gates. 

Case 1. Suppose that the target bit of both controlled-not operations is the 
least significant bit, as shown in (^. 

Suppose that Ai is sparse. We know from Lemma ^ that none of the 
matrices Bi, i = 1,2,3, can be an identity matrix, whence we have a total 
of five or more gates. If det U ^ 1, then Lemma |^ shows that at least one 
of the matrices Ai is not the identity matrix, giving an additional gate. 

Suppose that Ai is not sparse. Then A2 and A3 are not sparse either, 
by Lemma |6[ We know from Lemma 11 that at least one of the matrices 
Si , i?2 , -B3 is not an identity matrix, whence we have a total of at least six 
gates. 

Case 2. Suppose that the first controlled-not gate acts on the most signifi- 
cant bit and the second controlled-not gate acts on the least significant bit. 
So the circuit is of the form: 



C^ 



Co 



Do 



C3 



HCi 



CoH 



DoH 



C3 



^3 



(12) 



We use the circuit on the right hand to show that the circuit on the left 
hand side cannot have less than six elementary gates. 

Assume that HCi is sparse. Then C2H has to be sparse as well, hence 
Ci and C2 cannot be identity matrices. Lemma ^ shows that D3 7^ / and 
D2H ^ H, hence D2 7^ /. Thus we have at least six elementary gates. 

Assume that HCi is not sparse. Then C3 cannot be sparse. Either Ci 
or C2 has to differ from the identity matrix, because Lemma ^ shows that 
HCi and C2H cannot both be equal to H. Lemma ^ shows that HDi 7^ H 
and D2H 7^ H, hence Di,D2 7^ /. Thus we have at least six elementary 
gates. 

Case 3. Suppose that the first controlled-not gate acts on the least significant 
bit and the second controlled-not gate acts on the most significant bit. The 
inverse circuit cannot implement a controlled-C/^ operation with less than 
six elementary gates, because it is of the form discussed in Case 2. 
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Case 4- Finally, suppose that the target qubit of both controlled- not gates 
is the most significant qubit. Thus, the circuit is of the form 



Ci 



Co 



Do 



HCi 






HC2H 








— < 

4 


> 






HD2H 













(13) 



Assume that HCi is sparse, then C^H is sparse as well, thus Ci^Cj, 7^ /• 
Either Di or 1)3 differs from the identity, because Lemma shows that HDi 
and D^H cannot both be equal to H. Futhermore, Lemma |^ shows that 
HD2H 7^ /, hence D2 7^ /. Therefore, we have at least six gates. 



Assume that HCi is not sparse. Lemma |11| shows that HDi ^ H and 
D3H 7^ H holds. Therefore Di,D^ 7^ /. Lemma |6| shows that HC2H 
cannot be sparse, hence C2 7^ I- Therefore, we have at least five gates. If 
det C/ 7^ 1, then Di, C2, D3 cannot be the only nontrivial single qubit gates. 



as Lemma |12| shows, proving that we have at least six elementary gates in 
that case. □ 



More than Two Controlled-Not Gates. Although it is undesirable to 
use more than two controlled-not gates, we need to show (for mathematical 
completeness) that implementations of a controlled-f/ gate with three or 
more controlled-not gates cannot reduce the total number of elementary 
gates. Fortunately, it turns out that the proof of this case is much simpler, 
because an implementation with five or fewer gates can then have at most 
two single qubit gates. Thus, the circuit can be expressed in the following 
form: 



U 



Pi 



A" 



P2 



B 



P:, 



(14) 

where a,b £ {0, 1} determine the target bit of the single qubit gates A and B, 
respectively. The Pi implement permutations of the basis vectors realized 
by controlled-not operations. 

We collect some general observations about implementations of controlled- 
U gates with at most two single qubit gates in Lemma p!^-p!6[ It is then 
shown in Lemma ^^19 that an implementation of a controlled-C/ operation 
with three controlled-not gates and at most two single qubit gates cannot 
exist, when trf/ 7^ and tT{UX) 7^ 0. The remaining cases are simple 
consequences of Lemma |l^. 
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Lemma 14 Let U be a unitary 2x2 matrix. Suppose that there exists an 
implementation of the controUed-U gate with two single qubit gates A and B, 
and some controlled-not gates. Then A is sparse if and only if B is sparse. 

Proof. The input |00) remains unchanged by a controlled-C/ operation. If A 
is sparse, then the state after P2 is of the form a 1 61 60)1 with 61, 69 = 0) 1- This 
state has to be mapped by i? to a state of the form |ciCo), with ci, cq = 0, 1. 
Thus, B has to be sparse. The same argument applied to the inverse circuit 
shows that if B is sparse, then A has to be sparse as weh. □ 

Lemma 15 Let U he a unitary 2x2 matrix. Suppose that there exists an 
implementation of the controlled-U gate with two single qubit gates A and 
B, and some controlled-not gates. If trU ^ and triUX) 7^ 0, then A and 
B cannot be sparse. 

Proof. If A and B are sparse, then the circuit ( [l^ implements a monomial 
matrix. This would imply that U is sparse, contradicting either tr [/ 7^ or 
tr([7X) / 0. □ 

Denote by c(Pi) the number of controlled-not gates used to realize the 
permutation Pi in (|l^). 

Lemma 16 Let U be a unitary 2x2 matrix. If trU ^ and tr{UX) 7^ 0, 
then 0(^2) > in the circuit (fj^j. 

Proof If c(P2) =0, then (|l^ imphes 
" Pi 



The state input |00) will not be changed by the circuit on the left hand side, 
because P| and Pj are merely sequences of controlled-not gates. On the 
other hand, if a 7^ 6, then the circuit on the right hand side would map |00) 
to a superposition of base states, because A and B are not sparse. Thus, 
a = b, and BA has to be a diagonal matrix. However, this would imply 
that ( p!^ is realizing a monomial matrix, which contradicts tr [/ 7^ or 
tr{UX) 7^ 0. Therefore, c(P2) cannot be zero. □ 









— < 


t — 


^3 


- u - 



-A 



l-a. 



B 



l-b- 
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Three Controlled-Not Gates. We assume now that three controlled- 
not gates are used in the drcuit (0), that is, c(Pi) +0(^2) +c(P3) = 3 and 
< c{Pi) < 3. We may assume without loss of generahty that c{Pi) > 0(^3), 
for otherwise we can consider the inverse circuits. Consequently, 0(^3) is 
either or 1. In Lemma 18 we consider the case 0(^3) = 1, and in Lemma ^ 
the case 0(^3) = 0. 

Lemma 17 Let U be a unitary 2x2 matrix. If trU ^ and tr{UX) 7^ 0, 
then 0(^2) < 3 in the circuit nil) with three controUed-not gates. 



Proof. Seeking a contradiction, suppose that c(P2) = 3. The three controlled- 
not operations in P2 must have alternating target qubits, because otherwise 
it would be possible to reduce the number of controlled-not gates. Therefore, 
the circuit (O) can be rewritten in the form: 





X 


B'' 









(16) 

Clearly, this circuit is not able to realize a controlled-?/ operation, because 
it cannot entangle any unentangled input state. □ 

Lemma 18 Let U be a unitary matrix with tvU 7^ and tr{UX) ^ 0. 
// c(P3) = 1, then the circuit ^1^) with three controlled-not gates cannot 
implement a controlled-U operation. 

Proof. We have c(Pi) > c{Pz) and c{P2) > 0, hence c{Pi) = 1 for i = 1, 2, 3. 
The target bit of the controlled-not gate in Pi (or P3) cannot be the least 
significant bit, because this would imply that there exists an implementation 
of a controlled-XC/ (or a controlled-C/X) gate with two controlled-not gates 
and at most four elementary gates. Proposition 13 shows that this is not 
possible. Therefore, the circuit has to be of the form: 



1 



U 







A" 










— < 


» — 



P2 













51^ 


— < 


» 



(17) 



Regardless of the target bit of the controlled-not gate in P2, we get 



U 



Di 






D2 


— < 

4 


) — 




C2 
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for some unitary matrices Ci, C2, Di, If we input |10), then the circuit 
on the left hand side produces an entangled output state. This means Di 
has to be nonsparse, and Ci|0) cannot be an eigenstate of X. The input 
1 00) will not produce an entangled state in the circuit on the left hand side, 
but produces an entangled state on the right hand side. □ 

Lemma 19 Suppose that U is a unitary 2x2 matrix satisfying tr [/ 7^ 
and trUX ^ 0. If c{P'j,) = then the circuit ( fl^ with three controlled-not 
gates cannot implement a controlled-U gate. 



Proof. Since 0(^3) = and 0(^2) = 1,2, we have c{Pi] 
Therefore, the circuit is of the form 



2, 1, respectively. 



U 



Pi 



P2 



B 



l-b 



The permutation Pi is of the form 



Pi 



or 



(18) 



(19) 

because otherwise it would be possible to realize a controlled-XC/ operation 
with less than five operations, which contradicts Proposition [l^. Notice that 



A 



l-a 



We will take advantage of this identity to derive the desired contradiction. 
Realizing that 




and 



we find that the circuit (IT^) can be re-written in the form 



U 









A" 








B" 

51-^ 


4 




>- 


X 


P2 








» — 









(20) 



13 



or 



1 



U 









— < 


t — 









-B 



l-b 



(21) 



depending on the form of Pi shown in (|T9|), respectively. The circuits (12' 



and (21) can both be simphfied to contain at most five elementary gates, 
by reducing the combination of the swap operation with P2 to merely 1 and 
2 controlled-not gates, respectively. This would imply that the controlled- 
XU gate can be realized with at most four elementary gates, contradicting 
Proposition □ 

Proposition 20 Let U he a unitary matrix with tr [/ 7^ and tr([/X) 7^ 0. 
It is impossible to implement a controlled-U gate with two or fewer single 
qubit gates and three controlled-not gates. 



Proof. This follows from Lemma 18-19. □ 



Four or more Controlled-Not Gates. If we have more than three 
controlled-not gates, then an implementation with less than six elementary 



gates is not possible, because of Lemma 15 



Summary. We have shown that a generic controlled-C/ gate cannot be 
implemented with less than six elementary gates. In fact, we could rule out 
implementations based on a single controlled-not gate. Implementations 
with two controlled-not gates are possible, but at least four single qubit 
gates are necessary. The previous discussion showed that this gate count 
cannot be improved by implementations based on three or more controlled- 
not gates. This concludes the proof of Theorem A. □ 

3 Further Ramifications 

Let m{U) denote the minimal number of elementary gates that are needed to 
implement a controlled-C/ gate. We know from Q that m{U) < 6. We have 
shown in the previous section that m{U) = 6 provided that U is generic. 
We will show next that m{U) < 5 when U is not generic. 
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Theorem B Let U be a unitary 2x2 matrix. Let (j) and (po be real numbers 
in the range < (/> < 27r and < 0o < 27r. 

a) If U = I, then m{U) = 0. 

b) If U = e"t'I, then m{U) = 1. 

c) If U = X, then m{U) = 1. 

d) If U = e'^X, then m{U) = 2. 

e) If U = e^'t'^Z, then m{U) = 3. 

/) If trU = 0, det U = -1, U ^ ±X, then m{U) = 3. 

g) If tvU = 0, detC/ / -1, U + e'^'^X, U / e^<^«Z, then m{U) = 4. 

h) If tr UX = 0, trU 0, detU =1, U ^ ±1, then m{U) = 4. 

i) If tr UX = 0,tTU 0, detU^l,U ^ e^'^H, then m{U) = 5. 
j) If det [/ = 1, tr [/ ^ 0, tr C/X / 0, then m{U) = 5. 

Theorem B captures ah non-generic cases. The upper bounds on the number 
of gates are straightforward to see with the help of Table ||. 



Case 



Form 



Circuit 



if tr [/ = 



else if tr(C/X) = 



else if det [7 = 1 



else 



U = e^'f'PXP^ 



U = e^'t'PXP'^X 



U = CXBXA 



U = e^'f'CXBXA 



E 



A 



E 
A 



E 



P 



P 



B 



C 



B 



C 



(CI) 



(C2) 



(C3) 



(C4) 



Table 1: Quantum circuits for the implementation of controlled-f/ gates. 

We formally prove tight upper bounds on m{U) in the following simple 
Lemma. 

Lemma 21 The number m{U) of elementary gates given in the statement 
of Theorem B are sufficient to realize the corresponding controlled-U gates. 
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Proof. The cases a)-c) of Theorem B are obvious. 

Case d) U U = e'^X, then the chxuit (CI) in Table | with P 
E = diag(l,e*'^) implements a controUed-C/ gate, hence m{U) < 2. 

Case e)liU = e'^^Z, then 



/ and 



1 



U 



- A 


H 






H - 




— < 







HA 



H 



with A = diag(l,e''^o)^ hence m{U) < 3. 

Cases f, g) If trC/ = 0, then U is of the form U = e'^PXP^i. Therefore, 
circuit (CI) of Tabled with E = diag(l,e*'^) shows that m{U) < 4, which 
proves the upper bound of g). If in addition detU = —1, then = I in 
(CI), whence m{U) < 3. 

Cases h,i)li tT{UX) = 0, then the matrix U is of the form U = e^'^PXP'^X 
for some unitary matrix P, and G R. Let E = diag(l, e**^). The circuit 
(C2) in Table |l| shows that m{U) < 5, proving the upper bound of i). If in 
addition det [7 = 1, then necessarily E = I, hence m{U) < 4, proving the 
upper bound of h). 

Case j). If detU = 1, then it is possible to realize the controlled-?/ gate in 
the form (C3), as was shown in hence m{U) < 5. □ 

It remains to prove the lower bounds on m{U). The following lemma 
allows to prove the cases a)--f): 

Lemma 22 Suppose that U is a unitary 2x2 matrix. If the controUed-U 
gate can be implemented with one single qubit gate A and some controlled- 
not gates, then U has to be of the form U = e^'^I or U = e^'^'X for some 
G R. Furthermore, A has to be diagonal and can be assumed to be of the 
form A = diag(l, e*"^). 

Proof. The controlled-f7 gate is realized by a circuit of the form 



P 



U 



P2 



where Pi and P2 are permutations realized by controlled-not operations, and 
the target bit of A is selected by a G {0, 1}. The state |00) remains invariant 
under the action of a controlled-C/ gate. Since Pi|00) = |00) = P2IOO), it 
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follows that {A"" ^^~")|00) = |00), whence A has to be a diagonal matrix. 
By multiplying with an irrelevant global phase factor, we can assume that 
A is of the form A = diag(l, e**^). Notice that the phase of exactly two out 
of the four computational base states are changed to e*"^ by A, hence U has 
to be of the stated form. □ 



Lemma 23 Parts a)-f) of Theorem B hold. 



Proof. It is clear that no gate is needed to implement a controlled-identity 
gate, whence a) holds. In 6), only one phase gate is needed to affect the 
phase change, hence h) is true. At least one controlled-not gate is needed in 
the cases c)-f), because U has two different eigenvalues a and —a. We have 
m{U) > 2 in cases d)~f), because C/ / X. liU = e*'^X, then m{U) = 2 by 
Lemma ^T], which proves d) . 

We have U = e^'^^Z in case e). This gate can affect a phase change, hence 
at least one single qubit gate is needed. Lemma |2^ shows that circuits with 
one single qubit gate and controlled-not gate cannot implement U = 6*"^°^. 
Therefore, another single qubit gate is needed, that is, m(e*'^°Z) > 3, whence 
m{e^'^°Z) = 3 by Lemma 21. 



In case /), C/ is a unitary matrix with trU = 0, detU = —1, and 
U 7^ zizX. We know that m(U) > 2. Two controlled-not gates cannot 
implement such a gate because of the determinant condition. Lemma ^ 
shows that a single qubit gate and a controlled-not gate cannot implement U. 
Therefore, m(U) > 3, hence m{U) = 3 by Lemma 21. □ 



The remaining cases need a little bit more work. The next lemma gives 
some partial information about circuit with two single qubit gates and some 



controlled-not gates. Lemma 14 showed that A and B are either both sparse 



or both non-sparse. The sparse case is covered by the following lemma: 



Lemma 24 Suppose that U is a unitary 2x2 matrix. If the controlled-U 
gate can he implemented with two sparse single-qubit gates A and B, and 
some controlled-not gates, then the matrix U has to be of the form U = e**^/, 
U = e^'t'X, U = diag(e''^,e-^'^), or U = antidiag(e*'?^, e"^*^), where (/> G R. 

Proof. Since the matrices A and B are sparse, U has to be sparse as well, 
that is, U = diag(e*<^i, e*'^^) or C/ = antidiag(e*'^S e*'^^). 

Suppose that ^4 or i? is of the form e**^/, G R. Such a gate affects only 
a global phase change, and thus may be deleted. It follows from Lemma ^ 
that U is of the desired form. 
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Suppose that A and B are not multiples of the identity matrix. We 
may multiply A and B with global phase factors without changing the func- 
tionality of the circuit. Therefore, we can assume that A and B are of 
the form A = X"diag(l, e*'''''*) and B = diag(l, e*''^-B)X* for some a,b e 
{0, 1}, and < (f)A,4>B < 27r. Consider the four computational base states 
B = {|00), 1 01), 1 10), 1 11)} as inputs to our circuit. The circuit realizes a 
monomial matrix, because A and B are sparse. Therefore, ignoring or- 
der, the output of these four input states is given by a set of four states 
{q;|00),/3|01),7|10), 5|11)} with phase factors a,P,j,S. Since the circuit re- 
alizes a controlled-C/ operation with sparse U, the multiset of these four 
phase factors should be of the form Pu = {1,1, e^'f'^ , 6**^2 1, 

Notice that A and B each affect a phase change in exactly two of the 
four computational base states. During the evolution of a input state from 
B, the state might be multiplied by phases e**^-^ and e^'f'^ . If we record the 
combinatorial possibilities, then the multiset of phase factors {a, P, 7, 6} can 
be of the form: 

a) {e*'^'4^gj<}!>A^g*<^s^g«</'i3|^ provided A and B affect disjoint inputs, 

b) {l^e^<t>A^e^4>B^ei{4>A+<l>B)j^ provided a single input is affected by A and B, 

c) {1, 1, e'(«^'4+'^B), e^('^^+'^'B)}^ provided A, B affect the same two inputs. 

We compare these multisets with Pu to derive some constraints about U. 
It is clear that case a) cannot occur, because e*'^'*,e*'^^ 7^ 1. In case b), we 
necessarily have (pA = —(pB, therefore U is of the form U = diag(e*'^'*, e"**^-*) 
OT U = antidiag(e*'^^, e"*'''''*). In case c), it follows that U is of the form 
U = e'^I orU = e^^X with ^ = ^a + ^b-^ 

Remark 25 Suppose that a, unitary matrix U is of the form, U = e^'^X, 
U = e^'^I, U = diag(e*'?^,e-*'?^), or U = antidiag(e*'^, e"*"^). Notice that 

i) if ix{U) = 0, then U = e^'t'X, U = ±iZ, orU = antidiag(e^'^, e'^'t'), 

ii) if tr(C/) ^ 0, then U = e^'t'I, or U = diag(e*'^, e-*"^). 

The following lemma proves case g) oi Theorem B: 

Lemma 26 Suppose that U is a unitary 2x2 matrix. If ti(U) = 0, 
det(J7) / -1, [/ 7^ e^'f'X, and U / e'^Z, then m{U) = 4. 

Proof. Seeking a contradiction, we assume that m{U) < 3. The condition 
tr [7 = implies that U 7^ e**^/, i.e., at least one controlled- not gate is needed 
in the implementation. 
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Suppose that at least two controlled-not gates are used in the imple- 
mentation. This means that at most one single qubit gate can be used. 
Lemma 22 shows that U would have to be of the form U = e^'^X, contra- 



dicting the assumptions. Therefore, the potential implementation of U must 
have one controlled-not gate. 

Suppose now that one controlled-not gate and at most two single qubit 
gates A and B are used in the implementation. The matrices A and B are 



either both sparse or both not sparse by Lemma fL4 

Case 1. Assume that A and B are sparse. 
According to Lemma 
the form U 



Then U has to be sparse. 



p4| and Remark 25 the matrix U would have to be of 
U = ±iZ OT U = antidiag(e*'^,e-*'^). None of these 
matrices satisfies the assumptions of the lemma, contradiction. 

Case 2. Suppose now that A and B are not sparse. If the controlled-not 
gate has the same target bit as the controUed-C/ gate, then the circuit is of 
the form (^). Lemma ^ shows that A\ and A^ are sparse. It follows that 
the circuit has to be of the form 



A 



B 



This implies BA = e^'^l and BXA = e^'^'U , whence det U = —1, contradict- 
ing the assumptions. 

Assume now that target bit of the controlled-not gate is the most signif- 
icant bit. One easily sees that the two single qubit gates have to act on the 
most significant bit as well, i.e., the circuit is of the form 



A 



B 



HA 



H 



BH 



H 



It follows from Lemma ^ that HA and BH are sparse, whence A and B 
are both not sparse. Notice that |0), |1) are eigenvectors of U, say with 
eigenvalues uq, ai, respectively. Corollary ^ shows that BA = diag(l,ao)) 
and BXA = diag(l,ai). Comparing determinants shows that a2 = — ai- 
This implies that U is of the form U = diag(ai,a2) = aiZ, contradicting 
the assumptions. 

Therefore, we can conclude that m{U) > 4. We obtain m{U) = 4 with 
Lemma ^ □ 



We proceed with the proof of case h) of Theorem B. 
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Lemma 27 Let U be a unitary 2x2 matrix. If tv{UX) = 0, trC/ / 0, 
detU = 1, U ±1, then m{U) = 4. 

Proof. Seeking a contradiction, we assume that m{U) < 3. Lemma ^ shows 
that more than one controlled-not gate has to be used in the implementation 
of the controlled-C/ gate. We cannot have an implementation with three 
controlled-not gates, because the matrix corresponding to this circuit would 
have determinant —1. In the remaining case, one single qubit gate and two 
controlled-not gates are used for the implementation. Lemma ^ shows that 
a solution U with det [7 = 1 would have to be of the form U = ±7 or 
U = —X, all of which contradict the assumptions. We can conclude that 
m{U) > 4, hence m{U) = 4 by Lemma 21. □ 



The next lemma covers case of Theorem B. 

Lemma 28 Suppose that U is a unitary 2x2 matrix satisfying tr(C/) ^ 0, 
det U ^ 1, U e^'^I, and tic{UX) = 0. Then five elementary gates are 
necessary and sufficient to implement such a controlled-U gate. 

Proof. Seeking a contradiction, we assume that m{U) < 4. 

Case 1. Suppose that the implementation uses at least three controlled-not 
gates. According to Lemma ^ U would have to be of the form U = e^"^! 
or U = e^'^X, which contradicts the assumptions U ^ e**^ and ti{U) ^ 0. 

Case 2. Suppose that only one controlled-not gate is used in the implemen- 
tation of the controlled-C/ gate. It follows that tr [/ = by Lemma ^. This 
contradicts the assumption tr [7 7^ 0. 

Case 3. Suppose that two controlled-not gates and at most two single qubit 
gates A and B are used in the implementation of the controlled-C/ gate. 



We know from Lemma |l^ that A and B are either both sparse or both not 
sparse. 

Case 3.1. Suppose that A and B are both sparse. It follows from Lemma 
and Remark that U has to be of the form U = e^'^I or U = diag(e*'^, e"**^), 
which contradicts the assumptions U 7^ e^'^'I and det U ^ 1. 
Case 3.2. Suppose that A and B are both not sparse. We distinguish four 
different cases, depending on the target bit of the two controlled-not gates. 

Case ||. Suppose that the target bit of both controlled-not gates is the 
least significant bit. It follows from Lemma ^ and Lemma |^ that the single 
qubit gates have to act on the least significant bit as well. Assume without 
loss of generality that the control-f/ operation is implemented by a circuit 
of the form 
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u 



A 



B 



Comparing the result of the input |0)cgi|^/^) and ll)®]-;/') shows that e*^I = BA 
and e^^U = XBXA. Comparing determinants yields det?7 = 1, which 
contradicts our assumptions. 

Case It- Suppose that the most significant bit is the target bit of both 
controhed-not gates. There must be a single qubit gate, say B, on the most 
significant bit between the two controhed-not gates, because of Lemma ^. 
The other single quantum bit gate has to be on the most significant bit as 
weU, in order to map |00) to |00). 

Therefore, we may assume without loss of generality that the circuit is 
of the form 



A 



U 



B 



Notice that |0), |1) are eigenvectors of U, say with eigenvalues ao, ai, respec- 
tively. Corollary ^ shows that BA = diag(l, ao)) and XBXA = diag(l, ai). 
Comparing determinants shows that oq = ai. This implies that U is of the 
form U = diag(ao5ai) = ctol, contradicting the assumptions. 

Cases 11 and J,|. Finally, consider the case that the two controlled-not 
gates have different target bits. We may assume that the first controlled-not 
gate has the most significant bit as target bit. If this is not the case, the we 
simply consider the inverse circuit. The circuit is of the general form 



U 



C4 



Co 



C5 



C3 



HCi 



HCa 



C5H 



C3 



where two of the matrices Ci are given by A and B, and the remaining four 
are identity matrices. We use the circuit on the right hand side to derive a 
contradiction. 

Lemma ^ and Lemma ^ show that C^H ^ H, hence C5 7^ /. Conse- 
quently, at least one of the matrices Ci or C2 has to be the identity matrix. 
Therefore, HCi or C2H has to be nonsparse, whence C3 is nonsparse by 
Lemma ^. It follows that Ci = C2 = C4 = Cg = /. However, we know 



21 



from Lemma 10 that HCi and C2H cannot both be equal to H, thus it is 
impossible that Ci = C2 = /, contradiction. 

Therefore, we can conclude that it is impossible to implement a controlled- 
U gate with m[U) < 4 operations. It follows from Lemma ^ that m{U) = 5, 
which concludes the proof. □ 

It remains to show case j) of Theorem B: 

Lemma 29 Let U he a unitary 2x2 matrix. IfdetU = 1, trU ^ 0, and 
tT{UX) ^ 0, then m{U) = 5. 

Proof. It is not possible to implement such a controlled-C/ gate with only 
one controlled-not gate, cf. Lemma ^ If two controlled-not gates are used in 
the implementation, then Proposition |l^ shows that three additional single 
qubit gates are necessary. 

Assume that m{U) < 4 elementary gates are enough. If three or more 
controlled-not gates are used in the implementation, then at most one single 
qubit gate can be used. Lemma |2^ shows that U would have to be of the 
form U = e^'/'I 01 U = e^"^X, contradicting tr(C/X) / and tr [/ / 0. 

Therefore, m{U) > 5. It was shown in that m{U) < 5 when det [7=1, 
which proves the claim. □ 



4 Conclusions 

We have derived the minimal number of elementary gates that are necessary 
in any implementation of a controlled unitary gate. It would be interesting 
to know tight lower bounds for other fundamental constructions of quantum 
circuits. In particular, it would be nice to know the minimal number of 
elementary gates that are needed to realize doubly controlled-f/ gates, such 
as the Toffoli gate. 
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